Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 3333 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 524.2 KiB |
| Average record size in memory | 161.0 B |
Variable types
| Categorical | 3 |
|---|---|
| Numeric | 15 |
| Boolean | 3 |
state has a high cardinality: 51 distinct values | High cardinality |
phone number has a high cardinality: 3333 distinct values | High cardinality |
number vmail messages is highly overall correlated with voice mail plan | High correlation |
total day minutes is highly overall correlated with total day charge | High correlation |
total day charge is highly overall correlated with total day minutes | High correlation |
total eve minutes is highly overall correlated with total eve charge | High correlation |
total eve charge is highly overall correlated with total eve minutes | High correlation |
total night minutes is highly overall correlated with total night charge | High correlation |
total night charge is highly overall correlated with total night minutes | High correlation |
total intl minutes is highly overall correlated with total intl charge | High correlation |
total intl charge is highly overall correlated with total intl minutes | High correlation |
voice mail plan is highly overall correlated with number vmail messages | High correlation |
phone number is uniformly distributed | Uniform |
phone number has unique values | Unique |
number vmail messages has 2411 (72.3%) zeros | Zeros |
customer service calls has 697 (20.9%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-27 06:14:12.664054 |
|---|---|
| Analysis finished | 2022-11-27 06:15:06.237302 |
| Duration | 53.57 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
state
Categorical
| Distinct | 51 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
| WV | 106 |
|---|---|
| MN | 84 |
| NY | 83 |
| AL | 80 |
| WI | 78 |
| Other values (46) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 6666 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | KS |
|---|---|
| 2nd row | OH |
| 3rd row | NJ |
| 4th row | OH |
| 5th row | OK |
Common Values
| Value | Count | Frequency (%) |
| WV | 106 | 3.2% |
| MN | 84 | 2.5% |
| NY | 83 | 2.5% |
| AL | 80 | 2.4% |
| WI | 78 | 2.3% |
| OH | 78 | 2.3% |
| OR | 78 | 2.3% |
| WY | 77 | 2.3% |
| VA | 77 | 2.3% |
| CT | 74 | 2.2% |
| Other values (41) | 2518 |
Length
| Value | Count | Frequency (%) |
| wv | 106 | 3.2% |
| mn | 84 | 2.5% |
| ny | 83 | 2.5% |
| al | 80 | 2.4% |
| wi | 78 | 2.3% |
| oh | 78 | 2.3% |
| or | 78 | 2.3% |
| wy | 77 | 2.3% |
| va | 77 | 2.3% |
| ct | 74 | 2.2% |
| Other values (41) | 2518 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 734 | 11.0% |
| A | 687 | 10.3% |
| M | 612 | 9.2% |
| I | 515 | 7.7% |
| T | 412 | 6.2% |
| D | 380 | 5.7% |
| C | 356 | 5.3% |
| O | 346 | 5.2% |
| W | 327 | 4.9% |
| V | 322 | 4.8% |
| Other values (14) | 1975 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6666 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 734 | 11.0% |
| A | 687 | 10.3% |
| M | 612 | 9.2% |
| I | 515 | 7.7% |
| T | 412 | 6.2% |
| D | 380 | 5.7% |
| C | 356 | 5.3% |
| O | 346 | 5.2% |
| W | 327 | 4.9% |
| V | 322 | 4.8% |
| Other values (14) | 1975 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6666 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 734 | 11.0% |
| A | 687 | 10.3% |
| M | 612 | 9.2% |
| I | 515 | 7.7% |
| T | 412 | 6.2% |
| D | 380 | 5.7% |
| C | 356 | 5.3% |
| O | 346 | 5.2% |
| W | 327 | 4.9% |
| V | 322 | 4.8% |
| Other values (14) | 1975 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6666 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 734 | 11.0% |
| A | 687 | 10.3% |
| M | 612 | 9.2% |
| I | 515 | 7.7% |
| T | 412 | 6.2% |
| D | 380 | 5.7% |
| C | 356 | 5.3% |
| O | 346 | 5.2% |
| W | 327 | 4.9% |
| V | 322 | 4.8% |
| Other values (14) | 1975 |
account length
Real number (ℝ)
| Distinct | 212 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 101.06481 |
| Minimum | 1 |
|---|---|
| Maximum | 243 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 74 |
| median | 101 |
| Q3 | 127 |
| 95-th percentile | 167 |
| Maximum | 243 |
| Range | 242 |
| Interquartile range (IQR) | 53 |
Descriptive statistics
| Standard deviation | 39.822106 |
|---|---|
| Coefficient of variation (CV) | 0.39402545 |
| Kurtosis | -0.10783598 |
| Mean | 101.06481 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 0.096606294 |
| Sum | 336849 |
| Variance | 1585.8001 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 43 | 1.3% |
| 87 | 42 | 1.3% |
| 101 | 40 | 1.2% |
| 93 | 40 | 1.2% |
| 90 | 39 | 1.2% |
| 95 | 38 | 1.1% |
| 86 | 38 | 1.1% |
| 100 | 37 | 1.1% |
| 116 | 37 | 1.1% |
| 112 | 36 | 1.1% |
| Other values (202) | 2943 |
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 2 | 1 | < 0.1% |
| 3 | 5 | |
| 4 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 2 | 0.1% |
| 7 | 2 | 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 3 | 0.1% |
| 10 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 243 | 1 | < 0.1% |
| 232 | 1 | < 0.1% |
| 225 | 2 | |
| 224 | 2 | |
| 221 | 1 | < 0.1% |
| 217 | 2 | |
| 215 | 1 | < 0.1% |
| 212 | 2 | |
| 210 | 2 | |
| 209 | 3 |
area code
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
| 415 | |
|---|---|
| 510 | |
| 408 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 9999 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 415 |
|---|---|
| 2nd row | 415 |
| 3rd row | 415 |
| 4th row | 408 |
| 5th row | 415 |
Common Values
| Value | Count | Frequency (%) |
| 415 | 1655 | |
| 510 | 840 | |
| 408 | 838 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 415 | 1655 | |
| 510 | 840 | |
| 408 | 838 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2495 | |
| 5 | 2495 | |
| 4 | 2493 | |
| 0 | 1678 | |
| 8 | 838 | 8.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9999 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2495 | |
| 5 | 2495 | |
| 4 | 2493 | |
| 0 | 1678 | |
| 8 | 838 | 8.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9999 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2495 | |
| 5 | 2495 | |
| 4 | 2493 | |
| 0 | 1678 | |
| 8 | 838 | 8.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9999 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2495 | |
| 5 | 2495 | |
| 4 | 2493 | |
| 0 | 1678 | |
| 8 | 838 | 8.4% |
| Distinct | 3333 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.2 KiB |
| 382-4657 | 1 |
|---|---|
| 348-7071 | 1 |
| 389-6082 | 1 |
| 415-3689 | 1 |
| 379-2503 | 1 |
| Other values (3328) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 26664 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3333 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 382-4657 |
|---|---|
| 2nd row | 371-7191 |
| 3rd row | 358-1921 |
| 4th row | 375-9999 |
| 5th row | 330-6626 |
Common Values
| Value | Count | Frequency (%) |
| 382-4657 | 1 | < 0.1% |
| 348-7071 | 1 | < 0.1% |
| 389-6082 | 1 | < 0.1% |
| 415-3689 | 1 | < 0.1% |
| 379-2503 | 1 | < 0.1% |
| 396-1106 | 1 | < 0.1% |
| 379-4372 | 1 | < 0.1% |
| 336-3738 | 1 | < 0.1% |
| 380-2600 | 1 | < 0.1% |
| 345-4473 | 1 | < 0.1% |
| Other values (3323) | 3323 |
Length
| Value | Count | Frequency (%) |
| 382-4657 | 1 | < 0.1% |
| 407-7507 | 1 | < 0.1% |
| 363-1107 | 1 | < 0.1% |
| 358-1921 | 1 | < 0.1% |
| 375-9999 | 1 | < 0.1% |
| 330-6626 | 1 | < 0.1% |
| 391-8027 | 1 | < 0.1% |
| 355-9993 | 1 | < 0.1% |
| 329-9001 | 1 | < 0.1% |
| 335-4719 | 1 | < 0.1% |
| Other values (3323) | 3323 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 4626 | |
| - | 3333 | |
| 4 | 2820 | |
| 9 | 2090 | |
| 6 | 2070 | |
| 5 | 2050 | |
| 7 | 2037 | |
| 8 | 2005 | |
| 1 | 1979 | |
| 2 | 1891 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23331 | |
| Dash Punctuation | 3333 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 4626 | |
| 4 | 2820 | |
| 9 | 2090 | |
| 6 | 2070 | |
| 5 | 2050 | |
| 7 | 2037 | |
| 8 | 2005 | |
| 1 | 1979 | |
| 2 | 1891 | |
| 0 | 1763 | 7.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3333 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26664 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 4626 | |
| - | 3333 | |
| 4 | 2820 | |
| 9 | 2090 | |
| 6 | 2070 | |
| 5 | 2050 | |
| 7 | 2037 | |
| 8 | 2005 | |
| 1 | 1979 | |
| 2 | 1891 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26664 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 4626 | |
| - | 3333 | |
| 4 | 2820 | |
| 9 | 2090 | |
| 6 | 2070 | |
| 5 | 2050 | |
| 7 | 2037 | |
| 8 | 2005 | |
| 1 | 1979 | |
| 2 | 1891 |
international plan
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 3010 | |
| True | 323 | 9.7% |
voice mail plan
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 2411 | |
| True | 922 | 27.7% |
| Distinct | 46 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.0990099 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 2411 |
| Zeros (%) | 72.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 20 |
| 95-th percentile | 36 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.688365 |
|---|---|
| Coefficient of variation (CV) | 1.6901282 |
| Kurtosis | -0.051128539 |
| Mean | 8.0990099 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2648236 |
| Sum | 26994 |
| Variance | 187.37135 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2411 | |
| 31 | 60 | 1.8% |
| 29 | 53 | 1.6% |
| 28 | 51 | 1.5% |
| 33 | 46 | 1.4% |
| 27 | 44 | 1.3% |
| 30 | 44 | 1.3% |
| 24 | 42 | 1.3% |
| 26 | 41 | 1.2% |
| 32 | 41 | 1.2% |
| Other values (36) | 500 | 15.0% |
| Value | Count | Frequency (%) |
| 0 | 2411 | |
| 4 | 1 | < 0.1% |
| 8 | 2 | 0.1% |
| 9 | 2 | 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 2 | 0.1% |
| 12 | 6 | 0.2% |
| 13 | 4 | 0.1% |
| 14 | 7 | 0.2% |
| 15 | 9 | 0.3% |
| Value | Count | Frequency (%) |
| 51 | 1 | < 0.1% |
| 50 | 2 | 0.1% |
| 49 | 1 | < 0.1% |
| 48 | 2 | 0.1% |
| 47 | 3 | 0.1% |
| 46 | 4 | 0.1% |
| 45 | 6 | 0.2% |
| 44 | 7 | |
| 43 | 9 | |
| 42 | 15 |
total day minutes
Real number (ℝ)
| Distinct | 1667 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 179.7751 |
| Minimum | 0 |
|---|---|
| Maximum | 350.8 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 89.92 |
| Q1 | 143.7 |
| median | 179.4 |
| Q3 | 216.4 |
| 95-th percentile | 270.74 |
| Maximum | 350.8 |
| Range | 350.8 |
| Interquartile range (IQR) | 72.7 |
Descriptive statistics
| Standard deviation | 54.467389 |
|---|---|
| Coefficient of variation (CV) | 0.30297516 |
| Kurtosis | -0.019940379 |
| Mean | 179.7751 |
| Median Absolute Deviation (MAD) | 36.3 |
| Skewness | -0.029077067 |
| Sum | 599190.4 |
| Variance | 2966.6965 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 154 | 8 | 0.2% |
| 159.5 | 8 | 0.2% |
| 174.5 | 8 | 0.2% |
| 183.4 | 7 | 0.2% |
| 175.4 | 7 | 0.2% |
| 162.3 | 7 | 0.2% |
| 178.7 | 6 | 0.2% |
| 194.8 | 6 | 0.2% |
| 189.3 | 6 | 0.2% |
| 146.3 | 6 | 0.2% |
| Other values (1657) | 3264 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 2.6 | 1 | |
| 7.8 | 1 | |
| 7.9 | 1 | |
| 12.5 | 1 | |
| 17.6 | 1 | |
| 18.9 | 1 | |
| 19.5 | 1 | |
| 25.9 | 1 | |
| 27 | 1 |
| Value | Count | Frequency (%) |
| 350.8 | 1 | |
| 346.8 | 1 | |
| 345.3 | 1 | |
| 337.4 | 1 | |
| 335.5 | 1 | |
| 334.3 | 1 | |
| 332.9 | 1 | |
| 329.8 | 1 | |
| 328.1 | 1 | |
| 326.5 | 1 |
total day calls
Real number (ℝ)
| Distinct | 119 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.43564 |
| Minimum | 0 |
|---|---|
| Maximum | 165 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 67 |
| Q1 | 87 |
| median | 101 |
| Q3 | 114 |
| 95-th percentile | 133 |
| Maximum | 165 |
| Range | 165 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 20.069084 |
|---|---|
| Coefficient of variation (CV) | 0.19982034 |
| Kurtosis | 0.24318152 |
| Mean | 100.43564 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.11178664 |
| Sum | 334752 |
| Variance | 402.76814 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 102 | 78 | 2.3% |
| 105 | 75 | 2.3% |
| 95 | 69 | 2.1% |
| 107 | 69 | 2.1% |
| 104 | 68 | 2.0% |
| 108 | 67 | 2.0% |
| 97 | 67 | 2.0% |
| 106 | 66 | 2.0% |
| 112 | 66 | 2.0% |
| 110 | 66 | 2.0% |
| Other values (109) | 2642 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 30 | 1 | < 0.1% |
| 35 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 40 | 2 | |
| 42 | 2 | |
| 44 | 3 | |
| 45 | 3 | |
| 47 | 2 | |
| 48 | 3 |
| Value | Count | Frequency (%) |
| 165 | 1 | < 0.1% |
| 163 | 1 | < 0.1% |
| 160 | 1 | < 0.1% |
| 158 | 3 | |
| 157 | 1 | < 0.1% |
| 156 | 1 | < 0.1% |
| 152 | 1 | < 0.1% |
| 151 | 5 | |
| 150 | 6 | |
| 149 | 1 | < 0.1% |
total day charge
Real number (ℝ)
| Distinct | 1667 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.562307 |
| Minimum | 0 |
|---|---|
| Maximum | 59.64 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 15.288 |
| Q1 | 24.43 |
| median | 30.5 |
| Q3 | 36.79 |
| 95-th percentile | 46.028 |
| Maximum | 59.64 |
| Range | 59.64 |
| Interquartile range (IQR) | 12.36 |
Descriptive statistics
| Standard deviation | 9.2594346 |
|---|---|
| Coefficient of variation (CV) | 0.30296909 |
| Kurtosis | -0.019811787 |
| Mean | 30.562307 |
| Median Absolute Deviation (MAD) | 6.17 |
| Skewness | -0.029083268 |
| Sum | 101864.17 |
| Variance | 85.737128 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26.18 | 8 | 0.2% |
| 27.12 | 8 | 0.2% |
| 29.67 | 8 | 0.2% |
| 31.18 | 7 | 0.2% |
| 29.82 | 7 | 0.2% |
| 27.59 | 7 | 0.2% |
| 30.38 | 6 | 0.2% |
| 33.12 | 6 | 0.2% |
| 32.18 | 6 | 0.2% |
| 24.87 | 6 | 0.2% |
| Other values (1657) | 3264 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 0.44 | 1 | |
| 1.33 | 1 | |
| 1.34 | 1 | |
| 2.13 | 1 | |
| 2.99 | 1 | |
| 3.21 | 1 | |
| 3.32 | 1 | |
| 4.4 | 1 | |
| 4.59 | 1 |
| Value | Count | Frequency (%) |
| 59.64 | 1 | |
| 58.96 | 1 | |
| 58.7 | 1 | |
| 57.36 | 1 | |
| 57.04 | 1 | |
| 56.83 | 1 | |
| 56.59 | 1 | |
| 56.07 | 1 | |
| 55.78 | 1 | |
| 55.51 | 1 |
total eve minutes
Real number (ℝ)
| Distinct | 1611 |
|---|---|
| Distinct (%) | 48.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.98035 |
| Minimum | 0 |
|---|---|
| Maximum | 363.7 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 118.8 |
| Q1 | 166.6 |
| median | 201.4 |
| Q3 | 235.3 |
| 95-th percentile | 284.3 |
| Maximum | 363.7 |
| Range | 363.7 |
| Interquartile range (IQR) | 68.7 |
Descriptive statistics
| Standard deviation | 50.713844 |
|---|---|
| Coefficient of variation (CV) | 0.25233235 |
| Kurtosis | 0.025629753 |
| Mean | 200.98035 |
| Median Absolute Deviation (MAD) | 34.4 |
| Skewness | -0.023877456 |
| Sum | 669867.5 |
| Variance | 2571.894 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 169.9 | 9 | 0.3% |
| 167.2 | 7 | 0.2% |
| 180.5 | 7 | 0.2% |
| 201 | 7 | 0.2% |
| 161.7 | 7 | 0.2% |
| 209.4 | 7 | 0.2% |
| 230.9 | 7 | 0.2% |
| 220.6 | 7 | 0.2% |
| 195.5 | 7 | 0.2% |
| 230 | 6 | 0.2% |
| Other values (1601) | 3262 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 31.2 | 1 | |
| 42.2 | 1 | |
| 42.5 | 1 | |
| 43.9 | 1 | |
| 48.1 | 1 | |
| 49.2 | 1 | |
| 52.9 | 1 | |
| 56 | 1 | |
| 58.6 | 1 |
| Value | Count | Frequency (%) |
| 363.7 | 1 | |
| 361.8 | 1 | |
| 354.2 | 1 | |
| 351.6 | 1 | |
| 350.9 | 1 | |
| 350.5 | 1 | |
| 348.5 | 1 | |
| 347.3 | 1 | |
| 341.3 | 1 | |
| 339.9 | 1 |
total eve calls
Real number (ℝ)
| Distinct | 123 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.11431 |
| Minimum | 0 |
|---|---|
| Maximum | 170 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 67 |
| Q1 | 87 |
| median | 100 |
| Q3 | 114 |
| 95-th percentile | 133 |
| Maximum | 170 |
| Range | 170 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 19.922625 |
|---|---|
| Coefficient of variation (CV) | 0.19899877 |
| Kurtosis | 0.20615647 |
| Mean | 100.11431 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.055563139 |
| Sum | 333681 |
| Variance | 396.911 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 80 | 2.4% |
| 94 | 79 | 2.4% |
| 108 | 71 | 2.1% |
| 102 | 70 | 2.1% |
| 97 | 70 | 2.1% |
| 88 | 69 | 2.1% |
| 101 | 68 | 2.0% |
| 109 | 67 | 2.0% |
| 98 | 66 | 2.0% |
| 111 | 65 | 2.0% |
| Other values (113) | 2628 |
| Value | Count | Frequency (%) |
| 0 | 1 | < 0.1% |
| 12 | 1 | < 0.1% |
| 36 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 45 | 1 | < 0.1% |
| 46 | 3 | |
| 48 | 6 |
| Value | Count | Frequency (%) |
| 170 | 1 | < 0.1% |
| 168 | 1 | < 0.1% |
| 164 | 1 | < 0.1% |
| 159 | 1 | < 0.1% |
| 157 | 1 | < 0.1% |
| 156 | 1 | < 0.1% |
| 155 | 3 | |
| 154 | 2 | 0.1% |
| 153 | 1 | < 0.1% |
| 152 | 6 |
total eve charge
Real number (ℝ)
| Distinct | 1440 |
|---|---|
| Distinct (%) | 43.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.08354 |
| Minimum | 0 |
|---|---|
| Maximum | 30.91 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10.1 |
| Q1 | 14.16 |
| median | 17.12 |
| Q3 | 20 |
| 95-th percentile | 24.17 |
| Maximum | 30.91 |
| Range | 30.91 |
| Interquartile range (IQR) | 5.84 |
Descriptive statistics
| Standard deviation | 4.3106676 |
|---|---|
| Coefficient of variation (CV) | 0.25232871 |
| Kurtosis | 0.025487405 |
| Mean | 17.08354 |
| Median Absolute Deviation (MAD) | 2.92 |
| Skewness | -0.023857989 |
| Sum | 56939.44 |
| Variance | 18.581856 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14.25 | 11 | 0.3% |
| 16.12 | 11 | 0.3% |
| 15.9 | 10 | 0.3% |
| 17.09 | 9 | 0.3% |
| 18.62 | 9 | 0.3% |
| 17.99 | 9 | 0.3% |
| 14.44 | 9 | 0.3% |
| 18.96 | 8 | 0.2% |
| 16.35 | 8 | 0.2% |
| 16.97 | 8 | 0.2% |
| Other values (1430) | 3241 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 2.65 | 1 | |
| 3.59 | 1 | |
| 3.61 | 1 | |
| 3.73 | 1 | |
| 4.09 | 1 | |
| 4.18 | 1 | |
| 4.5 | 1 | |
| 4.76 | 1 | |
| 4.98 | 1 |
| Value | Count | Frequency (%) |
| 30.91 | 1 | |
| 30.75 | 1 | |
| 30.11 | 1 | |
| 29.89 | 1 | |
| 29.83 | 1 | |
| 29.79 | 1 | |
| 29.62 | 1 | |
| 29.52 | 1 | |
| 29.01 | 1 | |
| 28.89 | 1 |
total night minutes
Real number (ℝ)
| Distinct | 1591 |
|---|---|
| Distinct (%) | 47.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.87204 |
| Minimum | 23.2 |
|---|---|
| Maximum | 395 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 23.2 |
|---|---|
| 5-th percentile | 118.18 |
| Q1 | 167 |
| median | 201.2 |
| Q3 | 235.3 |
| 95-th percentile | 282.84 |
| Maximum | 395 |
| Range | 371.8 |
| Interquartile range (IQR) | 68.3 |
Descriptive statistics
| Standard deviation | 50.573847 |
|---|---|
| Coefficient of variation (CV) | 0.25177146 |
| Kurtosis | 0.085816078 |
| Mean | 200.87204 |
| Median Absolute Deviation (MAD) | 34.2 |
| Skewness | 0.0089212911 |
| Sum | 669506.5 |
| Variance | 2557.714 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 191.4 | 8 | 0.2% |
| 210 | 8 | 0.2% |
| 188.2 | 8 | 0.2% |
| 197.4 | 8 | 0.2% |
| 214.6 | 8 | 0.2% |
| 193.6 | 7 | 0.2% |
| 206.1 | 7 | 0.2% |
| 194.3 | 7 | 0.2% |
| 214.7 | 7 | 0.2% |
| 231.5 | 7 | 0.2% |
| Other values (1581) | 3258 |
| Value | Count | Frequency (%) |
| 23.2 | 1 | |
| 43.7 | 1 | |
| 45 | 1 | |
| 47.4 | 1 | |
| 50.1 | 2 | |
| 53.3 | 1 | |
| 54 | 1 | |
| 54.5 | 1 | |
| 56.6 | 1 | |
| 57.5 | 1 |
| Value | Count | Frequency (%) |
| 395 | 1 | |
| 381.9 | 1 | |
| 377.5 | 1 | |
| 367.7 | 1 | |
| 364.9 | 1 | |
| 364.3 | 1 | |
| 354.9 | 1 | |
| 352.5 | 1 | |
| 352.2 | 1 | |
| 350.2 | 1 |
total night calls
Real number (ℝ)
| Distinct | 120 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.10771 |
| Minimum | 33 |
|---|---|
| Maximum | 175 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 68 |
| Q1 | 87 |
| median | 100 |
| Q3 | 113 |
| 95-th percentile | 132 |
| Maximum | 175 |
| Range | 142 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 19.568609 |
|---|---|
| Coefficient of variation (CV) | 0.19547555 |
| Kurtosis | -0.072019579 |
| Mean | 100.10771 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.03249957 |
| Sum | 333659 |
| Variance | 382.93047 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 84 | 2.5% |
| 104 | 78 | 2.3% |
| 91 | 76 | 2.3% |
| 102 | 72 | 2.2% |
| 100 | 69 | 2.1% |
| 106 | 69 | 2.1% |
| 98 | 67 | 2.0% |
| 94 | 66 | 2.0% |
| 103 | 65 | 2.0% |
| 95 | 64 | 1.9% |
| Other values (110) | 2623 |
| Value | Count | Frequency (%) |
| 33 | 1 | |
| 36 | 1 | |
| 38 | 1 | |
| 42 | 2 | |
| 44 | 1 | |
| 46 | 1 | |
| 48 | 1 | |
| 49 | 2 | |
| 50 | 2 | |
| 51 | 2 |
| Value | Count | Frequency (%) |
| 175 | 1 | < 0.1% |
| 166 | 1 | < 0.1% |
| 164 | 1 | < 0.1% |
| 158 | 1 | < 0.1% |
| 157 | 2 | |
| 156 | 2 | |
| 155 | 2 | |
| 154 | 2 | |
| 153 | 3 | |
| 152 | 3 |
total night charge
Real number (ℝ)
| Distinct | 933 |
|---|---|
| Distinct (%) | 28.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.0393249 |
| Minimum | 1.04 |
|---|---|
| Maximum | 17.77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 1.04 |
|---|---|
| 5-th percentile | 5.316 |
| Q1 | 7.52 |
| median | 9.05 |
| Q3 | 10.59 |
| 95-th percentile | 12.73 |
| Maximum | 17.77 |
| Range | 16.73 |
| Interquartile range (IQR) | 3.07 |
Descriptive statistics
| Standard deviation | 2.2758728 |
|---|---|
| Coefficient of variation (CV) | 0.25177465 |
| Kurtosis | 0.08566318 |
| Mean | 9.0393249 |
| Median Absolute Deviation (MAD) | 1.54 |
| Skewness | 0.0088862368 |
| Sum | 30128.07 |
| Variance | 5.1795972 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9.66 | 15 | 0.5% |
| 9.45 | 15 | 0.5% |
| 8.47 | 14 | 0.4% |
| 8.88 | 14 | 0.4% |
| 7.69 | 13 | 0.4% |
| 8.64 | 12 | 0.4% |
| 10.8 | 11 | 0.3% |
| 10.49 | 11 | 0.3% |
| 10.35 | 11 | 0.3% |
| 8.57 | 11 | 0.3% |
| Other values (923) | 3206 |
| Value | Count | Frequency (%) |
| 1.04 | 1 | |
| 1.97 | 1 | |
| 2.03 | 1 | |
| 2.13 | 1 | |
| 2.25 | 2 | |
| 2.4 | 1 | |
| 2.43 | 1 | |
| 2.45 | 1 | |
| 2.55 | 1 | |
| 2.59 | 1 |
| Value | Count | Frequency (%) |
| 17.77 | 1 | |
| 17.19 | 1 | |
| 16.99 | 1 | |
| 16.55 | 1 | |
| 16.42 | 1 | |
| 16.39 | 1 | |
| 15.97 | 1 | |
| 15.86 | 1 | |
| 15.85 | 1 | |
| 15.76 | 1 |
total intl minutes
Real number (ℝ)
| Distinct | 162 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.237294 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 18 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.7 |
| Q1 | 8.5 |
| median | 10.3 |
| Q3 | 12.1 |
| 95-th percentile | 14.7 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 3.6 |
Descriptive statistics
| Standard deviation | 2.7918395 |
|---|---|
| Coefficient of variation (CV) | 0.27271265 |
| Kurtosis | 0.60918476 |
| Mean | 10.237294 |
| Median Absolute Deviation (MAD) | 1.8 |
| Skewness | -0.24513594 |
| Sum | 34120.9 |
| Variance | 7.7943681 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 62 | 1.9% |
| 11.3 | 59 | 1.8% |
| 9.8 | 56 | 1.7% |
| 10.9 | 56 | 1.7% |
| 10.1 | 53 | 1.6% |
| 10.6 | 53 | 1.6% |
| 10.2 | 53 | 1.6% |
| 11 | 52 | 1.6% |
| 11.1 | 52 | 1.6% |
| 9.7 | 51 | 1.5% |
| Other values (152) | 2786 |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 1.1 | 1 | < 0.1% |
| 1.3 | 1 | < 0.1% |
| 2 | 2 | 0.1% |
| 2.1 | 2 | 0.1% |
| 2.2 | 1 | < 0.1% |
| 2.4 | 1 | < 0.1% |
| 2.5 | 1 | < 0.1% |
| 2.6 | 1 | < 0.1% |
| 2.7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 18.9 | 1 | < 0.1% |
| 18.4 | 1 | < 0.1% |
| 18.3 | 1 | < 0.1% |
| 18.2 | 2 | |
| 18 | 3 | |
| 17.9 | 1 | < 0.1% |
| 17.8 | 2 | |
| 17.6 | 2 | |
| 17.5 | 3 |
total intl calls
Real number (ℝ)
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4794479 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 18 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.4612143 |
|---|---|
| Coefficient of variation (CV) | 0.54944589 |
| Kurtosis | 3.083589 |
| Mean | 4.4794479 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3214782 |
| Sum | 14930 |
| Variance | 6.0575757 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 668 | |
| 4 | 619 | |
| 2 | 489 | |
| 5 | 472 | |
| 6 | 336 | |
| 7 | 218 | 6.5% |
| 1 | 160 | 4.8% |
| 8 | 116 | 3.5% |
| 9 | 109 | 3.3% |
| 10 | 50 | 1.5% |
| Other values (11) | 96 | 2.9% |
| Value | Count | Frequency (%) |
| 0 | 18 | 0.5% |
| 1 | 160 | 4.8% |
| 2 | 489 | |
| 3 | 668 | |
| 4 | 619 | |
| 5 | 472 | |
| 6 | 336 | |
| 7 | 218 | 6.5% |
| 8 | 116 | 3.5% |
| 9 | 109 | 3.3% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 18 | 3 | 0.1% |
| 17 | 1 | < 0.1% |
| 16 | 2 | 0.1% |
| 15 | 7 | 0.2% |
| 14 | 6 | 0.2% |
| 13 | 14 | |
| 12 | 15 | |
| 11 | 28 |
total intl charge
Real number (ℝ)
| Distinct | 162 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7645815 |
| Minimum | 0 |
|---|---|
| Maximum | 5.4 |
| Zeros | 18 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.54 |
| Q1 | 2.3 |
| median | 2.78 |
| Q3 | 3.27 |
| 95-th percentile | 3.97 |
| Maximum | 5.4 |
| Range | 5.4 |
| Interquartile range (IQR) | 0.97 |
Descriptive statistics
| Standard deviation | 0.75377261 |
|---|---|
| Coefficient of variation (CV) | 0.27265343 |
| Kurtosis | 0.60961043 |
| Mean | 2.7645815 |
| Median Absolute Deviation (MAD) | 0.48 |
| Skewness | -0.24528651 |
| Sum | 9214.35 |
| Variance | 0.56817315 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.7 | 62 | 1.9% |
| 3.05 | 59 | 1.8% |
| 2.65 | 56 | 1.7% |
| 2.94 | 56 | 1.7% |
| 2.73 | 53 | 1.6% |
| 2.86 | 53 | 1.6% |
| 2.75 | 53 | 1.6% |
| 2.97 | 52 | 1.6% |
| 3 | 52 | 1.6% |
| 2.62 | 51 | 1.5% |
| Other values (152) | 2786 |
| Value | Count | Frequency (%) |
| 0 | 18 | |
| 0.3 | 1 | < 0.1% |
| 0.35 | 1 | < 0.1% |
| 0.54 | 2 | 0.1% |
| 0.57 | 2 | 0.1% |
| 0.59 | 1 | < 0.1% |
| 0.65 | 1 | < 0.1% |
| 0.68 | 1 | < 0.1% |
| 0.7 | 1 | < 0.1% |
| 0.73 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5.4 | 1 | < 0.1% |
| 5.1 | 1 | < 0.1% |
| 4.97 | 1 | < 0.1% |
| 4.94 | 1 | < 0.1% |
| 4.91 | 2 | |
| 4.86 | 3 | |
| 4.83 | 1 | < 0.1% |
| 4.81 | 2 | |
| 4.75 | 2 | |
| 4.73 | 3 |
customer service calls
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5628563 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 697 |
| Zeros (%) | 20.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.315491 |
|---|---|
| Coefficient of variation (CV) | 0.84172234 |
| Kurtosis | 1.7309137 |
| Mean | 1.5628563 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.0913595 |
| Sum | 5209 |
| Variance | 1.7305167 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1181 | |
| 2 | 759 | |
| 0 | 697 | |
| 3 | 429 | 12.9% |
| 4 | 166 | 5.0% |
| 5 | 66 | 2.0% |
| 6 | 22 | 0.7% |
| 7 | 9 | 0.3% |
| 9 | 2 | 0.1% |
| 8 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 697 | |
| 1 | 1181 | |
| 2 | 759 | |
| 3 | 429 | 12.9% |
| 4 | 166 | 5.0% |
| 5 | 66 | 2.0% |
| 6 | 22 | 0.7% |
| 7 | 9 | 0.3% |
| 8 | 2 | 0.1% |
| 9 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | 0.1% |
| 8 | 2 | 0.1% |
| 7 | 9 | 0.3% |
| 6 | 22 | 0.7% |
| 5 | 66 | 2.0% |
| 4 | 166 | 5.0% |
| 3 | 429 | 12.9% |
| 2 | 759 | |
| 1 | 1181 | |
| 0 | 697 |
churn
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 2850 | |
| True | 483 | 14.5% |
Auto
The auto setting is an interpretable pairwise column metric of the following mapping:- Variable_type-Variable_type : Method, Range
- Categorical-Categorical : Cramer's V, [0,1]
- Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
- Numerical-Numerical : Spearman's ρ, [-1,1]
This configuration uses the recommended metric for each pair of columns.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| state | account length | area code | phone number | international plan | voice mail plan | number vmail messages | total day minutes | total day calls | total day charge | total eve minutes | total eve calls | total eve charge | total night minutes | total night calls | total night charge | total intl minutes | total intl calls | total intl charge | customer service calls | churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | KS | 128 | 415 | 382-4657 | no | yes | 25 | 265.1 | 110 | 45.07 | 197.4 | 99 | 16.78 | 244.7 | 91 | 11.01 | 10.0 | 3 | 2.70 | 1 | False |
| 1 | OH | 107 | 415 | 371-7191 | no | yes | 26 | 161.6 | 123 | 27.47 | 195.5 | 103 | 16.62 | 254.4 | 103 | 11.45 | 13.7 | 3 | 3.70 | 1 | False |
| 2 | NJ | 137 | 415 | 358-1921 | no | no | 0 | 243.4 | 114 | 41.38 | 121.2 | 110 | 10.30 | 162.6 | 104 | 7.32 | 12.2 | 5 | 3.29 | 0 | False |
| 3 | OH | 84 | 408 | 375-9999 | yes | no | 0 | 299.4 | 71 | 50.90 | 61.9 | 88 | 5.26 | 196.9 | 89 | 8.86 | 6.6 | 7 | 1.78 | 2 | False |
| 4 | OK | 75 | 415 | 330-6626 | yes | no | 0 | 166.7 | 113 | 28.34 | 148.3 | 122 | 12.61 | 186.9 | 121 | 8.41 | 10.1 | 3 | 2.73 | 3 | False |
| 5 | AL | 118 | 510 | 391-8027 | yes | no | 0 | 223.4 | 98 | 37.98 | 220.6 | 101 | 18.75 | 203.9 | 118 | 9.18 | 6.3 | 6 | 1.70 | 0 | False |
| 6 | MA | 121 | 510 | 355-9993 | no | yes | 24 | 218.2 | 88 | 37.09 | 348.5 | 108 | 29.62 | 212.6 | 118 | 9.57 | 7.5 | 7 | 2.03 | 3 | False |
| 7 | MO | 147 | 415 | 329-9001 | yes | no | 0 | 157.0 | 79 | 26.69 | 103.1 | 94 | 8.76 | 211.8 | 96 | 9.53 | 7.1 | 6 | 1.92 | 0 | False |
| 8 | LA | 117 | 408 | 335-4719 | no | no | 0 | 184.5 | 97 | 31.37 | 351.6 | 80 | 29.89 | 215.8 | 90 | 9.71 | 8.7 | 4 | 2.35 | 1 | False |
| 9 | WV | 141 | 415 | 330-8173 | yes | yes | 37 | 258.6 | 84 | 43.96 | 222.0 | 111 | 18.87 | 326.4 | 97 | 14.69 | 11.2 | 5 | 3.02 | 0 | False |
| state | account length | area code | phone number | international plan | voice mail plan | number vmail messages | total day minutes | total day calls | total day charge | total eve minutes | total eve calls | total eve charge | total night minutes | total night calls | total night charge | total intl minutes | total intl calls | total intl charge | customer service calls | churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3323 | IN | 117 | 415 | 362-5899 | no | no | 0 | 118.4 | 126 | 20.13 | 249.3 | 97 | 21.19 | 227.0 | 56 | 10.22 | 13.6 | 3 | 3.67 | 5 | True |
| 3324 | WV | 159 | 415 | 377-1164 | no | no | 0 | 169.8 | 114 | 28.87 | 197.7 | 105 | 16.80 | 193.7 | 82 | 8.72 | 11.6 | 4 | 3.13 | 1 | False |
| 3325 | OH | 78 | 408 | 368-8555 | no | no | 0 | 193.4 | 99 | 32.88 | 116.9 | 88 | 9.94 | 243.3 | 109 | 10.95 | 9.3 | 4 | 2.51 | 2 | False |
| 3326 | OH | 96 | 415 | 347-6812 | no | no | 0 | 106.6 | 128 | 18.12 | 284.8 | 87 | 24.21 | 178.9 | 92 | 8.05 | 14.9 | 7 | 4.02 | 1 | False |
| 3327 | SC | 79 | 415 | 348-3830 | no | no | 0 | 134.7 | 98 | 22.90 | 189.7 | 68 | 16.12 | 221.4 | 128 | 9.96 | 11.8 | 5 | 3.19 | 2 | False |
| 3328 | AZ | 192 | 415 | 414-4276 | no | yes | 36 | 156.2 | 77 | 26.55 | 215.5 | 126 | 18.32 | 279.1 | 83 | 12.56 | 9.9 | 6 | 2.67 | 2 | False |
| 3329 | WV | 68 | 415 | 370-3271 | no | no | 0 | 231.1 | 57 | 39.29 | 153.4 | 55 | 13.04 | 191.3 | 123 | 8.61 | 9.6 | 4 | 2.59 | 3 | False |
| 3330 | RI | 28 | 510 | 328-8230 | no | no | 0 | 180.8 | 109 | 30.74 | 288.8 | 58 | 24.55 | 191.9 | 91 | 8.64 | 14.1 | 6 | 3.81 | 2 | False |
| 3331 | CT | 184 | 510 | 364-6381 | yes | no | 0 | 213.8 | 105 | 36.35 | 159.6 | 84 | 13.57 | 139.2 | 137 | 6.26 | 5.0 | 10 | 1.35 | 2 | False |
| 3332 | TN | 74 | 415 | 400-4344 | no | yes | 25 | 234.4 | 113 | 39.85 | 265.9 | 82 | 22.60 | 241.4 | 77 | 10.86 | 13.7 | 4 | 3.70 | 0 | False |